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Claims 

[cl ] 1 . A method for automatically generating a query, comprising: 

defining an organized classification of document content with each class in the organized 
classification of document content having associated therewith a classification label; each 
classification label corresponding to a category of information in an information retrieval 
system; 

identifying a set of entities in selected document content for searching additional 
information related thereto using the information retrieval system; 
assigning the selected document content a classification label from the organized 
classification of content; 

5 automatically formulating a query that restricts a search at the information retrieval 

system for information concerning the set of entities to the category of information in the 
III information retrieval system identified by the assigned classification label. 

p|2] 2. The method according to claim 1 , further comprising limiting the query by adding 

si terms relating to context information surrounding the set of entities in the selected 

document content. 

§p] 3. The method according to claim 2, wherein the number of terms added is limited to a 

jhf predefined number. 

fc4] 4. The method according to claim 2, further comprising limiting the query by adding 

terms defining the assigned classification label. 

[c5] 5. The method according to claim 1 , wherein the organized classification of document 

content is defined using a hierarchical organization. 

[ c6 J 6. The method according to claim 1 , further comprising using a text categorizer to assign 

the classification label assigned from the organized classification of content. 

[c7] 7. The method according to claim 6, further comprising: 

extracting with the text categorizer a set of terms relating to the document content; and 
appending to the query ones of the set of terms extracted by the text categorizer to 
contextualize the query. 

[ c8 J 8. The method according to claim 7, further comprising abbreviating the set of terms 

extracted by the text categorizer to a predefined limit of terms. 
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[c9] 9. The method according to claim 8, wherein said abbreviating comprises: 

extracting noun phrases from the selected document content; 

ranking the noun phrases by those that occur most frequently in the document content; 
defining a subset of noun phrases by identifying those ranked noun phrases that occur 
more frequently than a first predefined frequency; 

ranking those words in the subset of noun phrases by their frequency of occurrence to 
define an ordered list of words; 

defining a subset of the ordered list of words by identifying those ranked words that 

occur more frequently than a second predefined frequency; 

re-ranking the subset of words in inverse frequency to their use in the category of 
J{ information in the information retrieval system identified by the assigned classification 

?if label; 

|y using only those highest ranked words in the re-ranked subset of words to define the set 

of terms appended to the query. 

s I'll' 

fccl 0] 1 0. The method according to claim 1 , wherein each class in the organized classification 

J! of document content has associated therewith a characteristic vocabulary. 

(ft 1 ] 11- The method according to claim 1 0, further comprising ranking results from the query 

|rf performed at the information retrieval system in accordance with one of the assigned 

classification label and the characteristic vocabulary. 

[cl 2] 12. The method according to claim 1 1 , using the method in a system for enriching 

selected content of a document with personalities that identify enrichment themes. 

[cl 3] 1 3. The method according to claim 1 , further comprising automatically identifying the set 

of entities using a service that recognizes entities of a predefined type. 

[c14] 

14. A system for automatically generating a query, comprising: 

an entity extractor for identifying a set of entities in selected document content for 

searching information related thereto using an information retrieval system; 

a categorizer for defining an organized classification of document content with each class 

in the organization of content having associated therewith a classification label; each 

classification label corresponding to a category of information in the information retrieval 

system; the categorizer assigning the selected document content a classification label 

from the organized classification of content; 
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a query generator for automatically formulating a query that restricts a search at the 
information retrieval system for information concerning the set of entities to the category 
of information in the information retrieval system identified by the assigned classification 
label. 

[ cl 5] 1 5. The system according to claim 1 4, further comprising a short length aspect vector 

generator for generating terms relating to context information surrounding the set of 
entities in the selected document content; wherein the query generator adds the terms 
relating to the context information to limit the query. 

[cl 6] 1 6. The system according to claim 1 5, wherein the query generator further limits the 

J query by adding terms defining the selected classification label provided by the 

categorizer. 

fl 1 7 ] 1 7. The system according to claim 1 6, further comprising a content manager for 

JJ: enriching the selected document content with results provided from the information 

retrieval system using the query. 

Iff 8] 18. An article of manufacture for use in a computer system, comprising: 

fjf a memory; 

S instructions stored in the memory for operating a method for automatically generating a 

query, comprising: 

defining an organized classification of document content with each class in the organized 
classification of document content having associated therewith a classification label; each 
classification label corresponding to a category of information in an information retrieval 
system; 

identifying a set of entities in selected document content for searching information 
related thereto using the information retrieval system; 

assigning the selected document content a classification label from the organized 
classification of content; 

automatically formulating a query that restricts a search at the information retrieval 
system for information concerning the set of entities to the category of information in the 
information retrieval system identified by the assigned classification label. 

[cl9] 

1 9. The article of manufacture according to claim 1 8, wherein the instructions stored in 
the memory further comprise limiting the query by adding terms relating to context 
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information surrounding the set of entities in the selected document content. 

[c20] 20. The article of manufacture according to claim 1 9, wherein the instructions stored in 

the memory further comprise further limiting the query by adding terms defining the 
assigned classification label. 
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